Divide-and-conquer based large-scale spectral clustering
نویسندگان
چکیده
Spectral clustering is one of the most popular methods. However, how to balance efficiency and effectiveness large-scale spectral with limited computing resources has not been properly solved for a long time. In this paper, we propose divide-and-conquer based method strike good between effectiveness. proposed method, landmark selection algorithm novel approximate similarity matrix approach are designed construct sparse within low computational complexities. Then results can be computed quickly through bipartite graph partition process. The achieves lower complexity than existing Experimental on ten datasets have demonstrated method. MATLAB code experimental available at https://github.com/Li-Hongmin/MyPaperWithCode.
منابع مشابه
Spectral clustering for divide-and-conquer graph matching
We present a parallelized bijective graph matching algorithm that leverages seeds and is designed to match very large graphs. Our algorithm combines spectral graph embedding with existing state-of-the-art seeded graph matching procedures. We justify our approach by proving that modestly correlated, large stochastic block model random graphs are correctly matched utilizing very few seeds through...
متن کاملCorrelation clustering: divide and conquer
The correlation clustering is an NP-hard problem, hence its solving methods do not scale well. The contraction method and its improvement enable us to construct a divide and conquer algorithm, which could help us to clustering bigger sets. In this article we present the contraction method and compare the effectiveness of this new new and our old methods.
متن کاملReduced Complexity Divide and Conquer Algorithm for Large Scale TSPs
The Traveling Salesman Problem (TSP) is the problem of finding the shortest path passing through all given cities while only passing by each city once and finishing at the same starting city. This problem has NP-hard complexity making it extremely impractical to get the most optimal path even for problems as small as 20 cities since the number of permutations becomes too high. Many heuristic me...
متن کاملApplying Divide and Conquer to Large Scale Pattern Recognition Tasks
Rather than presenting a speciic trick, this paper aims at providing a methodology for large scale, real-world classiication tasks involving thousands of classes and millions of training patterns. Such problems arise in speech recognition, handwriting recognition and speaker or writer identiication, just to name a few. Given the typically very large number of classes to be distinguished, many a...
متن کاملA Novel K means Clustering Algorithm for Large Datasets Based on Divide and Conquer Technique
In this paper we propose an efficient algorithm that is based on divide and conquers technique for clustering the large datasets. In our research work we have applied divide and conquer technique on partitions of the large datasets and we have used squared Euclidean distance for measuring the similarity between data points. The partitioning of datasets is done according to the number of cluster...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Neurocomputing
سال: 2022
ISSN: ['0925-2312', '1872-8286']
DOI: https://doi.org/10.1016/j.neucom.2022.06.006